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1. INTRODUCTION 

Nowadays artificial intelligence becomes one of the defining start technologies, which is being 
worked on in a significant number of areas with regard to the possibility of its implementation. According to 
the research with P. Boucher at the front, artificial intelligence is defined as a system that simulates human 
behavior by analyzing the environment and performing functions, with a certain degree of autonomy, to 
achieve pre-formed goals [1]. Thus, artificial intelligence is a systemic definition that includes machine and 
deep learning, peculiarities of which are revealed while artificial intelligence models’ learning. 

The study provides a scientific basis for the application of artificial intelligence technology to 
determine the professional adaptive capabilities of construction management staff. If we base the 
development of an artificial intelligence information system for multi-value classification on the results of 
youth vocational guidance tests, it will improve the diagnosis of professional selection and will be able to 
provide recommendations for improving the productivity of professional implementation [2]. The level of 
trust of young people in the artificial intelligence when it comes to the questions of career guidance, choosing 
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a professional direction or profession according to their wishes, aptitudes, capabilities, and so on, is then 
determined. 

Ukrainian scientists Jaber and Palamarchuk analyzed the introduction of artificial intelligence 
elements in organizing and processing the results of professional orientation. In their work [3], they 
suggested a structure and methodology for constructing an intellectual system of the vocational guidance for 
the youth. Polish scientists Szymanski et al. [4] study the multi-label classification process. The paper [4] 
presents a multi-value classification for solving several or multiple output values. Scientists note that such a 
classification should be applied in system cases that have multiple input and output classes, the classification 
of which, respectively, is necessary. 

Scientists from Nigeria Owoseni ef al. [5] established the usefulness of Adaptive neuro-fuzzy 
inference system in predicting outcomes of events, processes or systems from their numerical datasets. The 
authors described mathematical details and proposed an improved Adaptive neuro-fuzzy inference system 
that uses an agglomerative-based clustering ensemble of fuzzy c-means to help extract rules from a given 
dataset that represent a system. The work [6] of Turkish scientists Acun ef al. states that the practical 
implementation of the decision tree for machine learning has much more nuances due to the significant 
number of input and output parameters, which will lead to poor predictability due to weak data normalcy. In 
research papers [7]-[11], the authors propose a multi-label classification that predicts the high efficiency of 
the neural network learning model and the quality of the output classifications in large-scale projects. 
Accordingly, each multi-value classification task contains its stratification by several parameters and its 
quality indicators. It also manages a data set with several parameters, performs regression and training by 
several parameters, and can represent the parameter space [12]. 

Shin and Karniadakis [13] suggests that machine learning with a teacher has a certain list of input 
data that a specialist wishes to use in an artificial intelligence model to denote the initial information. The 
paper [14] notes that machine learning without a teacher has only input data, where the model forms its 
structure. Research papers [15]-[17] explore the regression of machine learning applied in the traditional 
determination of cause-effect relationships between variables. Thus, the standard category of machine 
learning with a teacher is a classification task, which is one of the most popular methods for determining 
cause-effect relationships between variables in the form of a decision tree. 

Ojokoh et al. [18] applied big data technologies, analytics and artificial intelligence to accomplish 
complex tasks at levels beyond human skills. They proposed the use of artificial intelligence-based ensemble 
learning technique for validating the engineering properties of material. Sarumi et al. [19] developed an 
efficient intrusion model for preventing network intrusion attacks in real-life application scenarios using data 
analytics and machine intelligence. Few researchers attempted optimizing machine learning algorithms in 
[20]-[22] using open-source systems. Research papers [20] and [23] suggest to solving the problem to 
determine the professional adaptive capabilities of construction management based on a multidimensional 
data model. The studies [24] and [25] are devoted to the use of fuzzy logic methods to assess the impact of 
adaptability factors on the effectiveness of change management at construction industry enterprises. 

This study aims to develop a model of artificial intelligence that will solve the problem of multi- 
label classification of determining the professional adaptive capabilities of construction management staff. 
To achieve the research goal, it is necessary to investigate aspects of neural networks and their multi-value 
classification, develop a fully connected feed-forward neural network (FCF-FNN) architecture and perform 
empirical modeling to create a data set. 


2. THE MAIN RESEARCH 

Having analyzed the works and studies of Ukrainian and foreign scientists, we have developed a 
model of an FCF-FNN to perform the multi-value classification. Figure | shows the architecture of the 
developed FCF-FNN model. A neuron consists of synapses that provide communication between the inputs 
and the neuron kernel, which, in turn, processes input signals and uses the Axon to transmit the processed 
information to the output of the neuron, that is, to the neuron of another layer. The significance of input flows 
is adjusted through synapses that correct the weight that influences the state of the neuron’s kernels. Equation 
for determining the state of the neuron kernel, 


S = Vie % + i (1) 


where n is number of input neurons; x; is value of the corresponding i” neuron; w, is weight of the 


corresponding i’” synapse. 
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Equation for determining the value of the output neuron (Axon), 
r= FQ) (2) 


where F is activation function of input neuron processing (Axon). 
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Figure 1. The architecture of an FCF-FNN model 


The study uses the following activation functions that are suitable for multi-value classification: 
rectified linear unit (ReLU) and Softmax. The ReLU activation function allows for the optimal and 
productive use of resources displayed in the minimal involvement of neurons that have a simple derivative, 
where all negative values are 0 and positive 1. Shin and Karniadakis recognize one specific issue with using 
this activation feature, which is called “The Dying ReLU”. This problem occurs when ReLU neurons become 
inactive and only output the value 0 for any input neuron [13]. This indicates that the neuron is dying, 
depriving the model of further learning and reducing the accuracy of the classification results. Figure 2 shows 
a diagram of the ReLU activation function. 


Figure 2. Diagram of the ReLU activation function 


The Softmax activation function is usually used in the multi-value classification tasks. The 
advantage of using this function is that the sum of all input neurons will be equal to 1. Thus, the output 
values for the function are in the range from 0 to 1, which correspond to each output neuron, which is ranked 
by the probability of output, and whose highest probability, by default, is used in the program. The overall 
result of this function is a comprehensive prediction of all output neurons, considering the patterns of input 
neurons. Equation for determining the value of neurons of the softmax group, 
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es 


Vi = 5K oh (3) 


where j = 1,...,K and determines the number of the output neuron; 1}, e7 is adder of all neurons of softmax 
group; e” is exponent of the output value of the neuron softmax group. 

The basis of the neural network model is an forward neural network (FNN) multilayer perceptron, 
one of the basic fundamental architectures of neural networks, occurring when the signals are propagated 
exclusively from input layers to output. This architecture confirms the complete connection of the neuron to 
all neurons of the previous layer. Based on the research of Miljanovic [9], he chosen architecture is 
straightforward enough, which is determined by the transfer of information from inputs to outputs without the 
use of cycles. 

The study suggests referring to the classical Titanic problem, which forms a decision tree according 
to the classification and prediction of smaller subgroups. The work of Turkish scientists Ekinci et al. [6] 
states that the practical implementation of the decision tree for machine learning has much more nuances due 
to the significant number of input and output parameters, which will lead to poor predictability due to weak 
data normalcy. 

Thus, the classic Titanic problem, based on the ticket price, age, class, sex, and several other 
parameters, allows you to determine the probability of a person’s survival. The Problem is solved by 
predictive analysis. Kakde and Agrawal claim its essence lies in using computational methods that 
characterize the pattern in large amounts of data [16]. 

Therefore, it is logical that the prediction of survival in the Titanic problem is performed on 
different combinations of static features. That is why, to implement artificial intelligence in the organization 
and process the vocational guidance for secondary school students, the data analysis model is previously 
considered. This model comprises categories with the input and output data on which the artificial 
intelligence model will be taught. Figure 3 shows the model and categories of artificial intelligence data 
analysis. Accordingly, a quantitative sample of respondents sets a threshold value for a set of tests for 
training the model. Input and output descriptions are normalized and partially presented in Table 1. It is 
worth noting that the list of professional areas has a primary source-the National Career system of Great 
Britain [14], which was significantly adapted for the study needs. 


Data 
analysis 


Description of data 


analysis 


Figure 3. Model and categories of artificial intelligence data analysis research 


As part of the study, the training data set is formed by a comprehensive survey. To train, the model 
uses the results of a survey. The respondents have already worked in a particular professional field they find 
interesting. Thus, an empirical study reveals the normalizability of the sample for training the machine learning 
model, as well as for further classification and forecasting the outcomes for those respondents who have not yet 
decided on a professional direction. The information about them will also be obtained from the survey. 

A functional model based on the IDEFO methodology has been developed to describe the operation 
of the information system and the processes taking place within it. Figure 4 shows diagram for model of 
FCF-FNN, which allows evaluating the processes occurring in an FCF-FNN during the execution of 
multi-value classification. The FCF-FNN model has a comprehensive description of the information system. 
Input arrow “SCV file” is a set of information type int64 about a person undergoing professional orientation. 

The output data of the system is: 

a. An SCV file containing add-ons by the performed multi-value classification by professional areas and 
can be exported for use in other information systems; 
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b. DataFrame is a table that saves the data and allows for viewing processed data and data for further 
processing; 
c. An Hb5 file is a file of an artificial intelligence model that has undergone appropriate training and can be 


exported for implementation to another information system. 


Table 1. Description and characteristics of machine learning model data 


Description and characteristics of machine learning model data 


Noda Name of the data i nae 
parameter parameter Description Characteristics 
Age 
0 <—-—- <1, wh A ds 
1 Age Respondent's age (full years) max_Age Wer eee Eee 
to the value of the maximum age of respondents 
AT 
10 AT Percentage of self-confidence that takes the  O< <1, when max_q corresponds to the 
maximum value of 100 max_q 
maximum value of 100 
TT2 
ll TT? Percentage of caution that takes the O< <1, when max_g corresponds to the 
maximum value of 100 max_q 
maximum value of 100 
; Et oi at = RPT 
Indicator of a realistic professional O< <1, when max_PT corresponds to 
12 RPT personality type that takes the maximum max_ PT 
value of 14 the maximum value of 14 
i i si IPT 
Indicator of an intellectual professional < <1, when max_PT corresponds to 
13 IPT personality type that takes the maximum max_ PT 
value of 14 the maximum value of 14 
Indicator of an artistic professional < APT <1, when max_PT corresponds to 
14 APT personality type that takes the maximum max_ PT 
value of 14 the maximum value of 14 
Normalized indicator of the professional 
= nay direction “Charity and voluntary work” ae 
Normalized indicator of the professional 
3 eM direction “Engineering and manufacturing” ae 
37 H Normalized indicator of the professional H 
direction “Healthcare” 
50 Ne Normalized indicator of the professional Ne 


direction “Social care” 
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Figure 4. Model of FCF-FNN 


The “User” and “Software” arrow define the data transformation mechanisms (signal system 
guidance elements). The arrows validation_split (VS), batch_size (BS), activation (AC), optimizer (OP), 
epochs (EP) are the tools for implementing the data transformation. VS determines how to regulate the 
percentage of training data to test them. BS determines the number of rows of data that will be fed to the 
model during one epoch. AC (activation function) defines the type and manner of numeric data processing. 
OP determines the optimization algorithm by which the model will work. EP defines the number of repeated 
passes of study on training data sets. Figure 5 shows an algorithm for the training process of a machine 
learning model, which reflects the internal connections between the components of an artificial intelligence 
system that allow it to “learn” from training data. 
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Figure 5. Algorithm of the machine learning model training 


3. RESULTS 

To create a data set for subsequent training, we used 35 input parameters and 29 output parameters 
obtained empirically by research input and multiplication, preserving the mathematical median of the results. 
The amount of data in the set equals 936 rows of data. Neural network training occurred in the proportion of 
10% and 90%, respectively. 

The total training time was approximately 30 minutes, with 2 approaches for 100 epochs and 1 for 
1000 epochs. The average duration of one epoch is | second of real-time, with 20 lines of data per learning 
epoch. According to the results, it is logical that the model has already finished learning by the 300th general 
epoch. However, an additional overtraining check was performed. As a result, the model did not begin to 
degrade rapidly over 700 overtime periods and was able to maintain the accuracy of the answers. Figure 6 
shows a success of training the model. 

Figure 7 shows a mistake-ratio curve on the training and test sets, which means the neural network 
works correctly, minimizing the number of mistakes after sufficient learning time. According to the empirical 
study data, a number of real data of respondents was obtained, which made it possible to find out their 
compliance with a certain professional profile. Table 2 shows two popular results, namely respondents no. 10 
and no. 35, with the following probabilities. The overall results of each respondent of the study have an ultra- 
high accuracy of giving priority professional directions because the system performs ranking according to the 
highest compliance with the values of 1. Respectively, “weak” by nature professional directions can have the 
value “number e-19”. However, in the cases similar to respondent’s no. 35, it is worth considering even 
“weak” manifestations of professional trends, because an alternative allows the individual to better 
understand themselves. 

The respondents who are still looking for a job have demonstrated that the profession they have 
obtained coincides with the results of the experiment based on artificial intelligence. According to the study 
carried out, we consider it appropriate to recommend involving a developed artificial intelligence system to 
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offering career guidance to the future university students. Besides, our research results can be used to further 
improve the knowledge and skills necessary for successful professional realization. 
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Figure 6. Diagram of the correct answers ratio on training and test sets for a period of 1000 training epochs 
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Figure 7. Diagram of the mistake ratio for a period of 1000 epochs 


Table 2. Interpretation of neural networks results 
Interpretation of neural networks results 
Professional direction profile Probability of successful professional implementation 


Respondent's identification no 


EA 94.6% 

10 EU 3.5% 
EM 0.9% 

35 CVW 99.9% 
SC 0.0000033 % 


4. CONCLUSION 

The study proposes a solution to the problem of multi-value classification for determining the 
professional adaptive capabilities of construction management staff. The performed empirical modeling for 
creating a dataset confirms the adequacy of the FCF-FNN model to solve the problem. The proposed neural 
network architecture is straightforward, defined by the coherent transmission of information from inputs to 
outputs without the use of loops. When designing the Model, 2 activation functions were used: ReLU and 
Softmax, which made it possible to rank the results of a multi-valued classification of professional areas by 
priorities in the range of values from 0 to 1. The constructed model contains 128 Dense input layers (35 input 
parameters), 1256 hidden Dense layers, with an arbitrary dropout sample reduction of 60%, 80%, 60%, and 
then the data are sent to 29 output neurons of the softmax group. The authors consider it is possible to 
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develop a neural network model for the formation of a national database that will store “digital portraits of 
the individual”, according to which employers will be able to set specific requirements for staff of 
construction management. Such “digital portraits of the staff” can serve as both a virtual summary and a 
portfolio at the same time. 
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